AITopics | unsafe state

Supplementary Material

Neural Information Processing SystemsApr-25-2026, 23:03:53 GMT

Then each deterministic NN in {πw,b | (w,b) Wπ}is safe if and only if the system of constraints Φ(π,X0,Xu,) is not satisfiable. We prove the equivalent claim that there exists a weight vector (w,b) Wπ for which πw,b is unsafe if and only if Φ(π,X0,Xu,) is satisfiable. First, suppose that there exists a weight vector (w,b) Wπ for which πw,b is unsafe and we want to show that Φ(π,X0,Xu,) is satisfiable. This direction of the proof is straightforward since values of the network's neurons on the unsafe input give rise to a solution of Φ(π,X0,Xu,). Indeed, by assumption there exists a vector of input neuron values x0 X0 for which the corresponding vector of output neuron values xl = πw,b(x0) is unsafe, i.e. xl Xu.

artificial intelligence, machine learning, vector, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

544defa9fddff50c53b71c43e0da72be-Paper.pdf

Neural Information Processing SystemsApr-25-2026, 23:03:50 GMT

artificial intelligence, machine learning, safety, (16 more...)

Neural Information Processing Systems

Country:

Europe (1.00)
Asia (0.67)
North America > United States > California (0.46)

Genre: Research Report > New Finding (0.46)

Industry: Health & Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.67)

Add feedback

Provably Safe Reinforcement Learning with Step-wise Violation Constraints

Neural Information Processing SystemsFeb-16-2026, 10:54:39 GMT

We name this problem Safe-RL-SW . Our step-wise violation constraint differs from prior expected violation constraint (Wachi & Sui, 2020; Efroni et al., 2020b; Kalagarla et al., 2021) in two aspects: (i) Minimizing the step-wise violation enables the agent to learn an optimal policy that avoids unsafe regions deterministically,

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Illinois (0.04)
Asia > China (0.04)

Genre: Workflow (0.93)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.65)

Add feedback

Provably Safe Reinforcement Learning with Step-wise Violation Constraints

Neural Information Processing SystemsFeb-16-2026, 10:54:35 GMT

We name this problem Safe-RL-SW . Our step-wise violation constraint differs from prior expected violation constraint (Wachi & Sui, 2020; Efroni et al., 2020b; Kalagarla et al., 2021) in two aspects: (i) Minimizing the step-wise violation enables the agent to learn an optimal policy that avoids unsafe regions deterministically,

artificial intelligence, machine learning, reinforcement learning, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Illinois (0.04)
Asia > China (0.04)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.67)
Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (0.46)

Add feedback

Safe Reinforcement Learning by Imagining the Near Future

Neural Information Processing SystemsDec-24-2025, 07:29:00 GMT

Safe reinforcement learning is a promising path toward applying reinforcement learning algorithms to real-world problems, where suboptimal behaviors may lead to actual negative consequences. In this work, we focus on the setting where unsafe states can be avoided by planning ahead a short time into the future. In this setting, a model-based agent with a sufficiently accurate model can avoid unsafe states.We devise a model-based algorithm that heavily penalizes unsafe trajectories, and derive guarantees that our algorithm can avoid unsafe states under certain assumptions. Experiments demonstrate that our algorithm can achieve competitive rewards with fewer safety violations in several continuous control tasks.

algorithm, name change, safe reinforcement learning, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

aa3e67220ca4cd50010165c950fc8056-Supplemental-Conference.pdf

Neural Information Processing SystemsOct-9-2025, 04:17:03 GMT

constraint, unsafe state, violation, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Illinois (0.04)
Asia > China (0.04)

Genre: Workflow (0.93)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (0.67)

Add feedback

aa3e67220ca4cd50010165c950fc8056-Paper-Conference.pdf

Neural Information Processing SystemsOct-9-2025, 04:16:59 GMT

constraint, exploration, violation, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Illinois (0.04)
Asia > China (0.04)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.31)

Add feedback

Safe Reinforcement Learning by Imagining the Near Future

Neural Information Processing SystemsAug-15-2025, 04:55:50 GMT

In this work, we focus on the setting where unsafe states can be avoided by planning ahead a short time into the future. In this setting, a model-based agent with a sufficiently accurate model can avoid unsafe states.

algorithm, arxiv preprint arxiv, safety violation, (11 more...)

Neural Information Processing Systems

Country: North America > United States > California > Santa Clara County > Palo Alto (0.04)

Industry: Health & Medicine > Therapeutic Area (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Robots (0.94)

Add feedback

Probabilistic Shielding for Safe Reinforcement Learning

Court, Edwin Hamel-De le, Belardinelli, Francesco, Goodall, Alex W.

arXiv.org Machine LearningMar-17-2025

In real-life scenarios, a Reinforcement Learning (RL) agent aiming to maximise their reward, must often also behave in a safe manner, including at training time. Thus, much attention in recent years has been given to Safe RL, where an agent aims to learn an optimal policy among all policies that satisfy a given safety constraint. However, strict safety guarantees are often provided through approaches based on linear programming, and thus have limited scaling. In this paper we present a new, scalable method, which enjoys strict formal guarantees for Safe RL, in the case where the safety dynamics of the Markov Decision Process (MDP) are known, and safety is defined as an undiscounted probabilistic avoidance property. Our approach is based on state-augmentation of the MDP, and on the design of a shield that restricts the actions available to the agent. We show that our approach provides a strict formal safety guarantee that the agent stays safe at training and test time. Furthermore, we demonstrate that our approach is viable in practice through experimental evaluation.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

arXiv.org Machine Learning

2503.07671

Country:

North America > United States > New York > New York County > New York City (0.14)
North America > United States > California > Los Angeles County > Los Angeles (0.14)
Europe > Austria > Vienna (0.14)
(15 more...)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.48)

Add feedback

Enhance Exploration in Safe Reinforcement Learning with Contrastive Representation Learning

Doan, Duc Kien, Le, Bang Giang, Ta, Viet Cuong

arXiv.org Artificial IntelligenceMar-13-2025

In safe reinforcement learning, agent needs to balance between exploration actions and safety constraints. Following this paradigm, domain transfer approaches learn a prior Q-function from the related environments to prevent unsafe actions. However, because of the large number of false positives, some safe actions are never executed, leading to inadequate exploration in sparse-reward environments. In this work, we aim to learn an efficient state representation to balance the exploration and safety-prefer action in a sparse-reward environment. Firstly, the image input is mapped to latent representation by an auto-encoder. A further contrastive learning objective is employed to distinguish safe and unsafe states. In the learning phase, the latent distance is used to construct an additional safety check, which allows the agent to bias the exploration if it visits an unsafe state. To verify the effectiveness of our method, the experiment is carried out in three navigation-based MiniGrid environments. The result highlights that our method can explore the environment better while maintaining a good balance between safety and efficiency.

agent, exploration, unsafe state, (13 more...)

arXiv.org Artificial Intelligence

2503.10318

Country:

North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
Asia > Vietnam > Hanoi > Hanoi (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.64)

Industry: Energy (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.47)

Add feedback

Filters

Collaborating Authors

unsafe state

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Supplementary Material

544defa9fddff50c53b71c43e0da72be-Paper.pdf

Provably Safe Reinforcement Learning with Step-wise Violation Constraints

Provably Safe Reinforcement Learning with Step-wise Violation Constraints

Safe Reinforcement Learning by Imagining the Near Future

aa3e67220ca4cd50010165c950fc8056-Supplemental-Conference.pdf

aa3e67220ca4cd50010165c950fc8056-Paper-Conference.pdf

Safe Reinforcement Learning by Imagining the Near Future

Probabilistic Shielding for Safe Reinforcement Learning

Enhance Exploration in Safe Reinforcement Learning with Contrastive Representation Learning